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ABSTRACT 

Small non-coding RNAs such as miRNAs, piRNAs 
and endo-siRNAs fine-tune gene expression 
through post-transcriptional regulation, modulating 
important processes in development, differenti- 
ation, homeostasis and regeneration. Using deep 
sequencing, we have profiled small non-coding 
RNAs in Hydra magnipapillata and investigated 
changes in small RNA expression pattern during 
head regeneration. Our results reveal a unique rep- 
ertoire of small RNAs in hydra. We have identified 
126 miRNA loci; 123 of these miRNAs are unique to 
hydra. Less than 50% are conserved across two dif- 
ferent strains of Hydra vulgaris tested in this study, 
indicating a highly diverse nature of hydra miRNAs 
in contrast to bilaterian miRNAs. We also identified 
siRNAs derived from precursors with perfect stem- 
loop structure and that arise from inverted repeats. 
piRNAs were the most abundant small RNAs in 
hydra, mapping to transposable elements, the 
annotated transcriptome and unique non-coding 
regions on the genome. piRNAs that map to trans- 
posable elements and the annotated transcriptome 
display a ping-pong signature. Further, we have 
identified several miRNAs and piRNAs whose 
expression is regulated during hydra head regener- 
ation. Our study defines different classes of small 
RNAs in this cnidarian model system, which may 
play a role in orchestrating gene expression essen- 
tial for hydra regeneration. 



INTRODUCTION 

Small RNA-mediated silencing has emerged as an import- 
ant mediator of gene regulation across all organisms, 
regulating diverse functions from defense against genomic 
pathogens in prokaryotes to regulation of self-renewal, dif- 
ferentiation, immune response, cell migration and cell cycle 
in eukaryotes (1-3). Gene regulation by small RNAs is 
mediated through degradation of target mRNAs, suppres- 
sion of translation, DNA methylation, heterochromatin 
formation and programmed genome rearrangement. 
Based on their biogenesis and their associated proteins, 
regulatory small RNAs are classified into three types: 
microRNAs (miRNAs), endogenous silencing RNAs 
(endo-siRNAs) and piwi-associated RNAs (piRNAs) (4). 
miRNAs are 21-23-nucleotide (nt)-long RNAs that arise 
from hair-pin structures and mediate post-transcriptional 
gene regulation through mRNA degradation, translational 
repression and heterochromatin formation (1,4). 
Endo-siRNAs are synthesized through cleavage of long 
double-stranded RNAs, are 21-22 nt long and show 
perfect complementarity to their mRNA targets (5). 
siRNA-mediated silencing is evolutionarily conserved and 
is present in most of the eukaryotes. The least understood 
small RNAs, piRNAs, are 24-30 nt long and are expressed 
in germ cells at different developmental stages in 
Drosophila and mammals (6), whereas in Planarian 
Schmidtea mediterranean they are expressed in neoblasts, 
which are the somatic adult pluripotent stem cells (7). 
piRNAs primarily mediate transposon silencing through 
heterochromatin formation and arise from intergenic re- 
petitive elements. piRNAs also arise from protein coding 
genes such as the maf gene in Drosophila (8,9). In contrast 
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to endo-siRNAs, miRNAs and piRNAs have co-evolved 
with metazoa and are expressed in all multicellular organ- 
isms including basal metazoa like sponges and cnidaria. 
They are also expressed in some, but not all, unicellular 
organisms (10-12). Interestingly, the number of miRNAs 
in an organism increases with increasing tissue complexity 
in metazoa (10), suggesting that miRNAs could have 
contributed to evolution of bilateria from basal metazoa 
by generating a more complex network of gene regulation 
from existing set of genes, for example, by regulating evo- 
lution of tissue identity (13). 

Small RNAs also play an important role in develop- 
ment and regeneration by regulating cell proliferation 
and differentiation. Depletion of dicer, a key enzyme in 
miRNA biosynthesis, causes embryonic lethality in mice 
due to the lack of pluripotent stem cells (14). Depletion of 
DGCR8, another protein involved in miRNA biogenesis, 
in mouse embryonic stem cells (ESCs) leads to accumula- 
tion of ESCs in Gl phase due to a delayed Gl-S phase 
transition (15). This defect in proliferation can be rescued 
by members of miR-290 family, implicating a role for 
miRNAs in Gl-S phase transition in ESCs (16). miRNA 
biogenesis is also essential for caudal fin regeneration in 
zebrafish, where knock-down of dicer leads to regener- 
ation defects and miRNA miR-203 regulates regeneration 
through down-regulation of Lefl, a transcription factor 
required for Wnt-dependent transcription (17). Recent 
studies on Planaria, a triploblastic metazoan with robust 
regeneration capability, also identified several classes of 
small RNAs and their pathway genes involved in stem 
cell function and regeneration (7,18). 

Cnidaria is a sister clade of bilateria that is >500 million 
years old (19,20) and is classified into four distinct classes: 
anthozoa, hydrozoa, cubozoa and scyphozoa (21-23). 
These diploblastic radially symmetrical organisms 
usually alternate between two morphologically distinct 
forms — polyp form and a medusoid form. Unlike other 
diploblasts such as sponges and ctenophores, cnidarians 
have an axis and are organized into tissues. Cnidarians are 
the first multicellular organisms to use positional informa- 
tion for patterning and occupy an important position in 
the evolution of metazoa. Hydra is a fresh water cnidarian 
that exists exclusively in the polyp form and has been used 
as a model system for >250 years (24). Hydra has been 
used extensively to study regenerative biology, develop- 
mental biology and stem cell biology (25-27). Hydra has 
a remarkable ability to regenerate; when cut into pieces, 
each piece of body column, containing as little as 300 cells, 
can regenerate into a complete adult animal while main- 
taining the original polarity (28). It can also regenerate 
from a cluster of experimentally dissociated cells in 
which the axis has been disrupted and undergo de novo 
patterning (29); thus hydra stem cells have retained the 
ability to respond to morphogenetic signals and undergo 
patterning. 

In this study, we profiled small non-coding RNAs in 
Hydra magnipapillata and specifically investigated the 
small RNA profile during head regeneration. Small 
RNAs have been profiled earlier from another cnidarian 
Nematostella vectensis (10). Seventeen miRNAs and few 
piRNA-like RNAs have also been identified in hydra 



during hydra genome sequencing (30). Our preliminary 
studies show that hydra expresses genes essential for 
small RNA biosynthesis such as drosha, two isoforms 
each of dicer, ago and piwi genes and RNA-dependent 
RNA polymerase (RDRP). Hydra appears to express 
functional RNAi machinery, and gene knock-down 
using siRNA or dsRNAs has been performed in hydra 
(31,32). Here, we profile small non-coding RNAs 
in hydra such as miRNAs, siRNAs and piRNAs and 
investigate their role in hydra regeneration. Small 
RNA libraries were generated from regenerating 
H. magnipapillata at different time points during head re- 
generation, and the libraries were subjected to deep 
sequencing and analysis to identify miRNAs, piRNAs 
and endo-siRNAs. We found 126 miRNA loci in hydra, 
of which only 14 have been reported earlier. Interestingly, 
123 of 126 miRNAs are specific to hydra, suggesting either 
a highly diverse nature of hydra miRNAs or an independ- 
ent evolution of hydra miRNA genes. We have also 
identified several small RNAs of size 18-24nt whose pre- 
cursors exhibit perfect stem-loop structures and are 
derived from inverted repeats. Interestingly, the most 
abundant class of small RNAs in hydra is piRNAs, 
which map to transposable elements, the annotated tran- 
scriptome and unique regions of the genome. A majority 
of the piRNAs that uniquely map to the genome lack 
ping-pong signature of piRNA biosynthesis, whereas a 
significant number of piRNAs that map to transposable 
elements and annotated transcriptome have ping-pong 
signature. Further, we also identified several miRNAs 
and piRNAs that are regulated during hydra head regen- 
eration. In summary, our study reports 112 new miRNAs 
in hydra that might have distinct origins and also defines 
several different classes of small RNAs with a potential 
role in head regeneration. 

MATERIALS AND METHODS 

Hydra culture 

All three strains of hydra, H. magnipapillata, H. vulgaris 
AEP and H. vulgaris Ind-Pune, were cultured in hydra 
medium (0.1 mM KC1, ImM CaCl 2 , ImM NaCl, 
0.1 mM MgS0 4 , ImM Tris-Cl, pH 8.0) as described 
earlier at 18°C (33,34). Hydra were fed on alternate days 
with freshly hatched Artemia naupli but were starved 2-3 
days before experiment. For regeneration experiment, 
H. magnipapillata were cut in the mid-gastric region and 
allowed to regenerate, the lower foot region regenerated 
head by 3 days. The foot region regenerating head was 
used for isolating RNA. 

Small RNA library preparation 

Hydra magnipapillata RNA was isolated from uncut 
hydra as control and from polyps regenerating head 
from foot. RNA from H. vulgaris AEP and H. vulgaris 
Ind-Pune was isolated from uncut controls only. RNA 
was isolated from ~100 hydra regenerating head from 
foot region at different times, 3 h, 1 day, 2 day and 3 day 
using Trizol™ as per manufacturer's instructions. Five 
micrograms of total RNA was used to generate small 
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RNA libraries using TruSeq small RNA kit from Illumina 
as per manufacturer's protocol. Briefly specific 5' and 3' 
adapters were ligated to RNA; ligated products were 
reverse-transcribed and amplified by polymerase chain 
reaction (PCR). PCR products were separated using poly- 
acrylamide gel electrophoresis, and products correspond- 
ing to adapter-ligated 18-35-nt-long RNAs (140-160 bp) 
were eluted from the gel. The products were sequenced 
using Illumina HiSeq 1000. The entire experiment was 
performed twice to generate completely independent bio- 
logical duplicates. The small RNA libraries for H. vulgaris 
AEP and H. vulgaris Ind-Pune were generated using 
Illumina Digital Gene Expression Small RNA Sample 
Prep kit (vl.5 protocol). The libraries were sequenced 
using Illumina GAIIx. The depth of sequencing for all 
libraries, whether sequenced using Illumina GAIIx or 
using HiSeq 1000, was comparable. 

Bioinformatic analysis 

About 10-20 million reads were obtained from each 
library, and the sequencing data from all 10 samples 
from H. magnipapillata (five samples each from two bio- 
logical duplicates) were pooled for further analysis. 
To identify true miRNAs, 18-24 mer reads were mapped 
to hydra genome using Bowtie, and the reads mapping to 
hydra genome were further analysed. For each 18-24nt 
read, 100 nt of genomic sequence flanking either side of the 
read was extracted, and its ability to form stem-loop 
structures, which are characteristic of pre-miRNAs, was 
analysed using RNAfold. The reads with stem-loop struc- 
tures were further analysed using miRDeep 2 algorithm to 
search for miRNAs. Possible miRNAs were selected based 
on cut-off of — 10 to 10. These putative miRNAs were then 
manually examined for the presence of 1-2 mismatches in 
the stem and loop sequence of 4-8 nucleotides, and only 
those exhibiting these characteristics were considered to be 
miRNAs. 

To identify miRNAs that were differentially expressed in 
regenerating hydra as compared to the control, the reads 
from each sample were mapped to miRNAs identified by 
miRDeep2 and then were normalized to reads per million, 
to normalize for variations in the number of sequenced 
reads in each sample. We then calculated the correlation 
coefficient for two duplicates of each time point. For each 
time point of regeneration, the correlation coefficient was 
more than 0.95; we therefore did further analysis by taking 
average read numbers of the two duplicates. Averaged 
reads from all regenerates were individually compared to 
the control and expressed as fold-change over control. The 
small RNAs showing at least 2-fold variation for at least 
one time point were taken to be those showing differential 
expression in regeneration. 

To identify piRNAs that map to transposable elements, 
all the reads of size 25-30 nt reads were aligned using 
Bowtie aligner allowing zero mismatches to transposable 
elements obtained from RepBase. piRNAs that map to 
unique regions of the genome were identified by mapping 
25-30 nt reads to genome using Bowtie with option ml, 
which will retain the reads that map only once to the 
genome. To identify the piRNAs that map to annotated 



transcripts, all the reads of size 18-30 nt were aligned to 
annotated transcripts downloaded from NCBI using 
Bowtie aligner allowing no mismatches. The mapped files 
were converted to wiggle tracks, and the wiggle files were 
used as input for MATLAB to obtain density maps. The 
small RNA sequencing data have been deposited at the 
NCBI Sequence Read Archive (SRA) at http://www.ncbi 
.nlm.nih.gov/sra (accession number SRA050926). 

Northern analysis 

RNA was isolated from 2-day starved H. magnipapillata 
using Trizol™ as per manufacturer's instructions. Forty 
micrograms of total RNA was fractionated on 12% 
denaturing polyacrylamide gel and transferred on Nylon 
N + membrane. The blots were UV-cross-linked and baked 
at 80° C for 40 min. The probes were end-labelled with 32 P 
using T4 kinase and purified on G-25 columns. The blots 
were incubated with hybridization solution (0.2 M 
NaH 2 P0 4 and 7% SDS) alone for 2h, followed by 
probe in hybridization solution at 40°C for 16-20h. 
After hybridization, the blots were washed twice with 
2x SSC + 0.1% SDS for 15 min at room temperature 
and exposed to Phosphorimager screen. The signal 
obtained was the same size as that obtained by performing 
Northern analysis of S. mediterranea total RNA using 
miR-71C probe (18), indicating that the signal obtained 
was of the expected size of 21-22 nt (data not shown). 
The probes used for Northern analysis are listed 
in Supplementary Table S6. 

Quantitative RTPCR 

Quantitative PCR was performed as reported for miRNAs 
earlier (35). miRNA-specific primers (MSPs) were 
designed for each miRNA for quantitative RTPCR. The 
primer for reverse transcription was designed with 
double-stranded stem-loop structure with universal 
reverse primer binding site at 5' end and with last eight 
nucleotides at the 3'-end complementary to the 3' -end of 
miRNA. PCR was carried out using miRNA-specific 
forward primers and universal primer with complementar- 
ity to the 'stem' sequence of the MSP. All primers used 
are listed in Supplementary Table S6. There is no suitable 
normalizing control for hydra miRNA PCRs; therefore, 
to normalize the variations that could occur during reverse 
transcription and PCR, all RNA samples were spiked 
with 200 ng of planarian RNA (sexual strain of 
S. mediterranea) before reverse transcription. The reverse 
transcription and PCR for miR-71-al was performed 
along with hydra-specific miRNA PCRs, and the expres- 
sion levels of hydra miRNAs were normalized using 
miR71-al. miR71-al expression was not detected in 
hydra during deep sequencing. Consistent with this obser- 
vation, miR71-al primers did not amplify any product 
when used for reverse transcription and PCR using 
hydra RNA. Similarly, no significant amplification was 
observed when hydra miRNA-specific primers were used 
for reverse transcription and PCR using S. mediterranea 
RNA (data not shown). Reverse transcription was per- 
formed using Superscript III RT (Invitrogen), and the 
quantitative PCR was carried out using SYBR Green 
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(Applied Biosystems) as the fluorescent detector on an 
Applied Biosystems 7900HT machine. Each miRNA was 
analysed for expression profiles at different times of head 
regeneration, and the experiment was performed twice 
using with two sets of RNAs. Quantitative PCR for tran- 
scripts were performed using SYBR Green as the detector 
and actin as a normalizing control, and the probes used 
are listed in Supplementary Table S6. 

In situ hybridization 

Whole mount in situ hybridization was done using locked 
nucleic acid (LNA) miRCURY probes (EXIQON), using 
protocol for in situ hybridization as described previously 
for hydra (36). Probes designed were same as those used in 
Northern hybridization (Supplementary Table S6) except 
that they were tagged with digoxygenin at both 3' and 5' 
ends. Two-day starved hydra were relaxed with 2% ureth- 
ane and fixed overnight with 4% PFA at 4°C. Animals 
were washed five times with 100% methanol followed by 
washes with 75% methanol + 25% PBT (phosphate 
buffered saline + 0.1% Tween-20), 50% methanol + 50% 
PBT, 25% methanol + 75% PBT and then thrice with 
PBT. Polyps were incubated with 5 ug/ml of Proteinase 
K for lOmin, and the reaction was stopped with two 
washes of 4mg/ml glycine. Polyps were then washed 
with thrice with PBT, followed by two washes with 
lOOmM triethanolamine (TEA, pH 7.8). Hydra were 
then washed once with 0.25% acetic anhydride in 
lOOmM TEA (pH = 7.8) and then with 0.5% acetic an- 
hydride. Animals were washed thrice with PBT and refixed 
in 4% PFA. Animals were washed thrice with PBT and 
treated with 0.2 M HC1 for 30min, followed by three 
washes with PBT. Animals were then incubated at 80°C 
for 30 min to inactivate endogenous alkaline phosphatase. 
Animals were washed thrice with PBT and then with 50% 
PBT + 50% Hybridization solution at 58°C for 10 min. 
The hybridization solution contained 70% deionized 
formamide, 5x SSC (diluted from 20 x SSC: 3M sodium 
chloride, 0.3 M sodium citrate, pH 7.0), lx Denhardt's 
Solution (made from 50 x Stock: 1% Ficoll, 1% BSA, 
1% polyvinyl pyrrolidone), 0.1% CHAPS, 0.1% 
Tween-20, Heparin 100 ug/ml and 0.2mg/ml Yeast 
torula RNA. Animals were then washed with 100% 
hybridization solution at 58° C for 10 min. For pre-hybrid- 
ization, animals were incubated with hybridization 
solution for 2h at 58°C. After pre-hybridization, animals 
were incubated with lOnM LNA probe in hybridization 
solution for 36 h at 58°C. Animals were then washed once 
each with hybridization solution, 75% hybridization solu- 
tion +25% 2x SSC, 50% hybridization solution + 50% 
2x SSC, 25% hybridization solution + 75% 2x SSC and 
then with 100% 2x SSC. Animals were incubated with 
20 ug/ml RNase A in 2x SSC + 0.1% CHAPS for 30 
min at 37°C and then washed with 2x SSC + 0.1% 
CHAPS twice at hybridization temperature. This was 
followed by two washes with MAB (100 mM maleic 
acid, 150mM sodium chloride, pH = 7.5) at room tem- 
perature for 10 min and with MAB-B (1% BSA in 
MAB) for 1 h. Animals were incubated with blocking 
solution (80% MAB-B + 20% Heat inactivated Horse 



Serum) at room temperature for 2h and then overnight 
with pre-adsorbed 1:4000 anti-DIG alkaline phosphate 
conjugated antibody (Roche) at 4°C. Animals were then 
washed with MAB eight times and left overnight at 4°C in 
MAB. Animals were washed twice with NTMT (100 mM 
NaCl, lOOmM Tris-Cl pH 9.5, 50mM MgCl 2 + 0.1% 
Tween-20), incubated in 1 mM levamisole for 5 min and 
then incubated with BM-Purple (Roche) at 37°C till 
colour developed. Animals were washed with 100% 
methanol to remove non-specific staining and stored 
in PBS. 

RESULTS 

Deep sequencing of small RNA libraries from 
H. magnipapillata 

Small RNAs in hydra have not been studied systematic- 
ally, and their role in regeneration is entirely unexplored. 
The first genome sequencing report on H. magnipapillata 
identified 17 miRNAs and small RNAs that resembled 
piRNAs (30). In this study, we used next generation 
sequencing to comprehensively identify the small regula- 
tory RNA repertoire in H. magnipapillata as well as to 
identify those small RNAs that are regulated during re- 
generation. Hydra has an enormous ability to regenerate 
after mid-gastric bisection, each piece regenerating into a 
complete functional organism in 3-4 days while maintain- 
ing the original polarity. Within 3h post-bisection, the 
fusion of cut epithelial layers is complete (37), head acti- 
vation begins (38,39) and Wnt3 is expressed, indicating 
initiation of axis formation (40). By 24 h, hypostome for- 
mation is observed, with some regenerates showing devel- 
opment of tentacle buds. By 2 days, tentacles emerge and 
by 3 days almost all hydra show well-formed tentacles 
(Supplementary Figure SI A) (41). In this study, we 
investigated the small RNA expression profile during 
head regeneration. Hydra polyps were cut in mid-gastric 
region, and the RNA was isolated from foot regions 
regenerating heads at 3h, 1 day, 2 days and 3 days. RNA 
was isolated from duplicate regeneration experiments for 
analysis. 

Total RNA isolated from uncut polyps and hydra re- 
generates at different times during head regeneration was 
ligated to adapters, amplified and size-selected for 
products corresponding to RNAs of size 18-35 nt on 
native PAGE. These small RNA libraries were then sub- 
jected to sequencing using an Illumina platform. About 
10-20 million reads were obtained from each library, 
and sequencing data from all samples were pooled for 
further analysis. A total of 131 million reads correspond- 
ing to 18-3 5 nt size were obtained from these sequencing 
reactions. These reads were aligned to the hydra genome 
using Bowtie alignment (42), allowing no mismatch. 
Ninety-seven million reads mapped to the genome and 
of these 15.8% were 18-24 nt reads and 67.2% were 
25-30 nt reads. Small RNA profiling of mapped reads 
revealed two distinct peaks, one at 21-22 nt position, 
corresponding to miRNA and endo-siRNA, and another 
peak at 27-29 nt position corresponding to piRNAs 
(Figure 1A). 
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Intriguingly, we found that only 49.5% of the total 
18-24nt reads mapped to the genome (Figure IB), 
which is less than expected. Potentially this could be due 
to the inadequate (6x) coverage in the published hydra 
genome (30). Inadvertent contamination is unlikely to 
account for this observation, as a high percentage 
(84.9%) of reads representing 25-30 nt sequences 
mapped to the genome (Figure IB). It is unlikely that 
these reads arise from Artemia larvae that are fed to 
hydra, as the polyps were starved for two days before 
isolating RNA. Moreover when the unmapped reads 
(50.5% of total reads) were mapped to Artemia EST 
data set, only 6.2% of these reads mapped with no 
mismatch and 6.8% mapped when two mismatches were 
allowed. These reads also did not map to 
H. magnipapillata symbiont such as Curvibacter (30). But 
8.6% of these unmapped reads did map to bacterial 
genomes that are available at http://www.genome.jp/ 
kegg/, suggesting ~4.3% of the total 18-24 nt reads are 
of bacterial origin, potentially from bacterial species that 
could be endosymbionts in hydra. No significant align- 
ment was observed when these reads were mapped to 
Rfam database (which includes hydra non-coding RNA 
data set), genomic tRNA database at http://gtrnadb.ucsc 
.edu/, or small non-coding RNA database at http:// 
biobases.ibch.poznan.pl/ncRNA/. 

Interestingly, when the unmapped 18-24nt reads were 
re-aligned to the hydra genome allowing one mismatch, 
28% of these reads now mapped to the genome. Further 
increasing mismatches to two during alignment increased 
mapped reads to 58% (data not shown). In alignment 
carried out allowing two mismatches, 28% mismatches 



show A to G change, 13% show T to A change and 
16% show T to C change. These small RNAs also show 
a preference for 'A' and T' at 5 ; -end, suggesting they are 
products of Dicer/Argonaute cleavage rather than degrad- 
ation products (data not shown). The higher percentage of 
A-G mismatch among the small RNA reads (18-24nt) 
suggests the possibility of RNA editing (43). 
Intriguingly, RNA editing alone does not explain the 
presence of mismatches such as T to A and T to C in 
the reads corresponding to 18-24nt size, and reasons for 
these mismatches remain unclear. 

Hydra expresses a unique set of miRNAs 

To identify novel miRNAs, we mapped all 18-24 bp reads 
to the hydra genome and predicted novel miRNAs using 
the program miRDeep2 (44). The miRDeep2 predicted list 
was filtered based on scores >+10 and randfold P- value 
<0.05. Precursor structures obtained after filtering were 
manually curated based on the presence of 1 -2 mismatches 
in the stem region, a loop sequence of 4-8 nt, and the 
presence of mature sequence in the stem region 
(Figure 2A). Output from mirDeep2 consisted of loci 
from hydra genome to which 18-24 mers map allowing 
one mismatch. Using these criteria, miRDeep2 identified 
128 miRNA loci in the H. magnipapillata genome. We 
reanalysed 18-24 mer reads that mapped to these 
miRNA loci, allowing no mismatch, and all 128 miRNA 
loci identified mapped to the genome without any 
mismatch (Supplementary Table SI). Of these 128 loci, 
two potential miRNAs, hma-mir-new2- 1 and hma- 
mir-new2-2, were not considered for further analysis for 
reasons described below. Among the rest 126 loci, we 
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Figure 1. Small RNA profiling of H. magnipapillata. (A) Reads from deep sequencing of all libraries from H. magnipapillata were pooled and size 
distribution of reads was analysed. Reads between 18-35nt show two peaks, one at size 21 nt and another at 28-29 nt. (B) Table shows the total 
number of reads of size 18-24nt and 25-30 nt sequenced and the percentage of these reads mapping to hydra genome. 
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Figure 2. Identification of hydra miRNAs using miRDeep2. (A) Figure shows sequence of a typical miRNA (hma-mir-new46) identified by 
miRDeep2, showing mature strand and star strand and the frequency of the reads obtained for both strands during deep sequencing. (B) Base 
preference for predicted miRNAs. All predicted miRNAs show preference for U at 5'-end indicating processing by dicer. (C) Pie chart depicting 
miRNAs that are common between set of miRNAs identified in this study, hydra miRNAs present in miRBase and N. vectensis miRNAs present in 
miRBase. (D) 18-24nt reads from H. vulgaris Ind-Pune and H. vulgaris AEP were mapped to miRNAs identified in H. magnipapillata allowing no 
mismatch and figure shows miRNAs conserved across these species of hydra. 
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found 112 with both mature and star strand with perfect 
alignment to genome (Supplementary Table S2). In case of 
14 miRNAs, only the mature strand and not star strands 
showing perfect alignment to the hydra genome were 
identified (Supplementary Table S2). 

To test the validity of miRDeep2 program, we checked 
for the presence of previously annotated hydra miRNAs 
from miRBase. We found 14 of 17 miRNAs previously 
predicted (30), underscoring the authenticity of the 
miRDeep2 program for prediction of hydra miRNAs 
(Figure 2C and Supplementary Table SI). The sequences 
corresponding to remaining three miRNAs were present 
in the list of 18-24 bp reads, but miRDeep2 failed to 
predict them as miRNA because of the absence of corres- 
ponding star strands from deep sequencing data. Further, 
we also analysed conservation of hydra miRNA loci in 
other metazoans. Only 3 of 126 miRNAs from hydra 
were conserved in another cnidarian, Nematostella 
vectensis (nve-miR-2022, nve-miR-2029 and nve-miR- 
2030) (10), as shown in Figure 2C, and none of these 
miRNAs were conserved in higher metazoa or in 
sponges. Interestingly, neither the highly conserved 
miRNA miR-100, which is expressed in most metazoa, 
including N. vectensis, nor miRNA let-7, which is 
conserved in bilateria (13), was identified in our 
sequencing analysis, suggesting a unique repertoire of 
hydra miRNAs. 

Most miRNAs range in size from 21-22 nt and show 
strong enrichment for 'U' or 'A' residue at 5'-end, a sig- 
nature for potential Dicer cleavage products (Figure 2B). 
We found eight miRNAs with identical mature strand 
sequence but different precursor sequences 
(Supplementary Table S3). In general, many miRNAs in 
complex metazoans exist in a cluster and are derived from 
individual multi-cistronic transcripts. In hydra, we found 
a single miRNA cluster on contig ABRMO 1025207.1 with 
two miRNA loci (hma-mir-new22 and hma-mir-new63) 
separated by 500 bp (Supplementary Figure SIB and C). 
Interestingly, we also found another pair of miRNA loci 
(hma-mir-new66 and hma-mir-new 1 1) that maps to the 
same genomic region but display an antisense orientation 
to each other on the contig ABRMO 1070 152.1 
(Supplementary Figure SIC). miRNAs are typically 21- 
22 nt in size and can be visualized by Northern hybridiza- 
tion using antisense probes against specific mature 
miRNAs. We analysed 13 of the miRDeep2-predicted 
miRNAs by Northern hybridization. As a control for 
Northern hybridization, we performed Northern hybrid- 
ization with S. mediterranea RNA using miRHC probe as 
described (18). The signal obtained for hydra miRNAs 
was at the same position as miRllC (data not shown), 
confirming the expected size of 21-22 nt in 7 of 13 
miRNAs investigated (Figure 3 A). We further 
performed reverse transcription-polymerase chain 
reaction (RT-PCR) to confirm expression of new 
miRNAs that were identified in this study (35). The 
PCR products obtained were cloned and sequenced to 
confirm expression. We performed RT-PCR analysis on 
13 newly identified miRNAs and were able to confirm 
expression of all 13 miRNAs by sequencing 
(Supplementary Table S2). Interestingly, two highly 



expressed small RNAs, hma-miR-new2-l and 
hma-mir-new 2-2 , with different precursor sequences but 
same mature strand (Supplementary Table SI), could not 
be detected either by Northern hybridization or PCR, and 
hence have been removed from analysis. Interestingly, this 
small RNA was identified in libraries of all three strains of 
hydra sequenced in this study. The number of miRNAs 
reported in this study therefore is 126. We also performed 
in situ hybridization for miRNAs to investigate their lo- 
calization in hydra as shown in Figure 4. We used plan- 
arian lin-4-3p miRNA as a negative control, as it is not 
expressed in hydra. All three miRNAs showed specific 
expression; hma-miR-3005 and hma-miR-2030 were ex- 
pressed throughout the body column including ectoderm 
and endoderm, till the base of tentacles but not in ten- 
tacles, whereas hma-miR-3010 was expressed only in the 
endoderm and at the base of tentacles. 

Conservation of miRNAs across hydra strains 

In general, the miRNA loci are highly conserved across all 
species of an organism. We therefore analysed for the 
sequence conservation of all 238 5p and 3p arms of 126 
miRNA loci across two other strains of hydra, H. vulgaris 
AEP and H. vulgaris Ind-Pune, which are geographically 
separated from each other and from H. magnipapillata. 
H. magnipapillata was originally isolated from Japan 
and has been used for research over many years (30); the 
AEP strain of Hydra vulgaris was isolated from the USA 
(45) and is an important tool in hydra biology owing to its 
ability to readily undergo sexual reproduction. Hydra 
normally reproduces asexually by budding and sexual re- 
production occurs under conditions of stress; the AEP 
strain can be readily induced to undergo sexual reproduc- 
tion and has been used for generating transgenic lines of 
hydra (45). H. vulgaris Ind-Pune strain was isolated from 
Pune, India; it has been used for developmental studies 
and was recently classified as a strain of H. vulgaris 
(34,46). H. magnipapillata and H. vulgaris species are 
closely related and belong to the same monophyletic 
vulgaris group, although H. vulgaris AEP appears to be 
closer to H. carnea rather than rest of the vulgaris 
species (47). 

Small RNA libraries were generated using total RNA 
from Hydra vulgaris AEP and H. vulgaris Ind-Pune and 
were sequenced using the Illumina platform. These reads 
were aligned to H. magnipapillata miRNA database con- 
sisting of 238 5p and 3p arm sequences, allowing no 
mismatches. Surprisingly the number of miRNAs 
conserved between different species of hydra is low, in 
contrast to bilateria where a higher percentage of 
miRNAs are conserved between different species, with 
only 27 miRNAs (5p and 3p) common between all three 
species. Out of 238 sequences, 109 were common in 
H. vulgaris Ind-Pune and H. magnipapillata, whereas 30 
were common between H. magnipapillata and H. vulgaris 
AEP (Figure 2D and Supplementary Table S2). It is also 
possible that miRNAs expressed at very low levels in these 
two strains were perhaps not detected in this study. Our 
observations that H. vulgaris Ind-Pune shares more 
miRNAs with H. magnipapillata than H. vulgaris AEP 
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Figure 3. Northern analysis of miRNAs identified by miRDeep2. (A) Northern analysis was performed on H. magnipapillata RNA using 
radiolabeled probe for mature sequence of miRNAs indicated, along with the stem-loop structure predicted by miRDeep2. There are two pairs 
of miRNAs (hma-miR-3012b-l and hma-miR-3012b-2, hma-miR-3010-1 and hma-miR-3010-2) where two precursors have identical mature sequence. 
Stem-loop structures for both miRNAs from these pairs are depicted in the figure. (B) Figure shows Northern analysis done for endo-siRNA with 
perfect stem-loop structure, gi_195009632_19624. 



can be attributed to the phylogenetic distance reported 
between these strains (47,48). 

We also found several small RNAs that were identified 
as miRNAs by miRDeep2, but unlike miRNAs, they have 



perfect matches in the stem region. These small RNAs are 
highly AU rich and could have been derived from inverted 
repeats. They have l-2nt overhang at 3'-end of the pre- 
cursor and constitute 19.6% of small RNAs identified as 
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Figure 4. In situ hybridization showing expression pattern of hydra miRNAs. (A) In situ hybridization was performed using DIG-labelled locked 
nucleic acid probes for miRNAs hma-miR-2030, hma-miR-3005 and hma-miR-3010. lin-4-3p probe was used as a negative control, as it is not 
expressed in hydra. Inset shows expression pattern in ectoderm and endoderm at higher magnification. (B) Figure shows expression pattern of a 
perfect stem-loop structure gi_195009632_19624 seen after in situ hybridization. 



miRNAs by miRDeep2. We have classified these as 
endo-siRNAs, and they appear to be similar to the 
hairpin RNAs (hpRNAs) in the unicellular algae 
Chlamydomonas (12). Northern analysis for one such 
sequence showed hybridization to a 21 nt species confirm- 
ing it as potential small RNA (Figure 3B). In situ hybrid- 
ization of this small RNA revealed expression throughout 
the body column in both ectoderm and endoderm but not 



in the tentacles, with more expression in budding region 
than in the rest of the body column (Figure 4B). 

Putative piRNAs in hydra 

Our analysis also revealed a distinct class of small RNAs 
that were 27-30 nt in size. These are the most abundant 
class of small RNAs (67.2% of total reads) in hydra 
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(Figure 1A) and might represent potential piRNAs, as 
they are of the same size as piRNAs and they possess 
sequence signatures seen in piRNAs from higher 
metazoa, as described below. Because we do not have 
direct evidence showing the binding of these small 
RNAs to PIWI proteins, we call these small RNAs 
putative piRNAs in this report. piRNAs are typically ex- 
pressed in germ cells in Drosophila and mammals, and are 
important for germ line stem cell function (6). In planarian 
S. mediterranea, they are expressed in neoblasts, the adult 
pluripotent stem cells (7). piRNAs are also expressed in 
diploblastic animals like Nematostella and Amphimedon, 
but the expression pattern of these piRNAs is not known 
(10). In Drosophila, most piRNAs map to transposable 
elements and are responsible for transposon silencing 
(49,50). Drosophila piRNAs also to map to transcripts 
and may down-regulate them (8,9). In mammals, 
piRNAs are mostly expressed during spermatogenesis 
and are broadly divided into two categories: pre-pachy- 
tene piRNAs and pachytene piRNAs. Pre-pachytene 
piRNAs map to transposable elements and are involved 
in silencing of transposable elements (51). Pachytene 
piRNAs are expressed during late meiosis and map to 
unique regions in the mouse genome, but their function 
is not clearly understood (52,53). In the present work, 
small RNA libraries were made from the asexual form 
of hydra, which is devoid of gonads, and therefore 
putative piRNAs thus identified here are mostly present 
in somatic cells. 

In Drosophila, piRNA biogenesis is proposed to follow 
two distinct models, which have been described as 
'ping-pong dependent' and 'ping-pong independent' 
models (54). In the ping-pong model, piRNAs are bidir- 
ectional, with sense and antisense piRNAs showing lOnt 
overlap. Further, sense strands are characterized by the 
presence of 'A' residue at 10th position and antisense 
strands have 'U' at the 5'-end. In ping-pong independent 
model, the sense and antisense strands could be bidirec- 
tional but do not have lOnt overlap. A similar sequence 
analysis in mammals suggests that pre-pachytene piRNAs 
arise in ping-pong-dependent manner, whereas pachytene 
arise in ping-pong-independent manner. In order to find 
out the regions to which putative piRNAs map in hydra, 
we aligned all reads >24nt to transposable elements of 
hydra from RepBase, to hydra genome and to the 
annotated transcriptome. We further analysed these 
putative piRNAs to predict the mechanism of biogenesis 
of these putative piRNAs. Interestingly, we find that in 
hydra 10% of reads map to transposable elements, 
28.4% of reads map to unique regions on hydra genome 
and 13% of reads map to annotated transcriptome. 

In all, 56.7% putative piRNAs that map to transposable 
elements have bidirectional orientation where sense strands 
have strong ping-pong signature with strong preference for 
'A' residue at 10th nucleotide position, suggesting that the 
biogenesis of these putative piRNAs is likely to occur via 
ping-pong model (Figure 5A, left panel of Figure 5B, and 
Supplementary Table S7). The rest of the 43.2% bidirec- 
tional putative piRNAs that map to transposable elements 
do not have 10 nucleotide overlap, and the sense strands 
from these putative piRNAs do not have a preference for A 



at 10th nucleotide (Figure 5B, right panel and 
Supplementary Table S7). These putative piRNAs, 
however, have a strong preference for 'U' residue at the 
5'-end, indicating they are products of processing by 
Argonaute family of proteins, rather than RNA degrad- 
ation products. These sequence signatures suggest their bio- 
genesis could occur via a ping-pong-independent 
mechanism rather than the absence of ping-pong partners 
owing to the lack of sequencing depth (Figure 5B, 
right panel). 

We found 28.4% of 25-30 nt reads mapping to unique 
non-coding regions on the genome, which resemble mam- 
malian pachytene piRNAs. The numbers of reads that 
map to the contig vary from 1000-500 000 reads/contig. 
A total of 96.7% of these putative piRNAs do not have 
lOnt overlap (Figure 5 A) and have strong enrichment for 
'U' at the 5'-end (Figure 5C, top panel and Supplementary 
Table S7), suggesting that they could arise in ping-pong- 
independent manner. With the exception of putative 
piRNAs that map to ABRM0 1000095.1 contig, which 
have bidirectional distribution (Figure 5C, top panel), 
most of the other major clusters (> 100 000) map to 
either of the strands on the contigs (Figure 5C, bottom 
panel). It is, however, difficult to determine the exact 
number of such piRNA clusters on the genome owing to 
incomplete assembly of the reference hydra genome. 

A significant number of small RNAs (13%) mapped to 
the annotated transcriptome. We selected all reads of size 
1 8-30 nt and aligned them to the annotated transcriptome 
from NCBI. We found a strong enrichment for the reads 
of size 27-30 nt compared with 18-24nt (Figure 6A). The 
small fraction (~10%) of 18-24 bp reads that map to tran- 
scripts were evenly distributed across 18-24nt without any 
specific enrichment for a particular size (Figure 6A). 
Further, 18-24nt reads were mostly derived from the 
same regions of the transcript where 27-30 nt reads 
align, suggesting they are potential degradation products 
of 27-30 nt (data not shown). We then filtered the align- 
ment file based on the density of reads that map to the 
transcripts. The density was the ratio of the number of 
reads that map to the transcript per kilobase of the tran- 
script. For further analysis, transcripts with density of at 
least 100 reads of piRNAs mapping per kilobase of tran- 
script were considered. We found 2141 transcripts with 
density of at least 1000, and 2265 transcripts with 
density of 100-999. Reads that map to these transcripts 
are both unidirectional and bidirectional. We found 27% 
of these reads have ping-pong signature (Figure 5A and 
Figure 6B). The remaining 73% of putative piRNAs do 
not have lOnt overlap, but majority of antisense piRNAs 
have strong enrichment for U at the 5'-end, ruling out the 
possibility that these are degradation products (Figure 6C, 
right panel and Supplementary Table S7). However, most 
of the sense piRNAs that do not have lOnt overlap lack 
base residue preferences at both the 5'-end and at lOnt 
position, suggesting they are potential degradation 
products of transcripts and not piRNAs (Figure 6C, left 
panel and Supplementary Table S7). Thus, transcripts that 
have only sense orientation putative piRNAs mapping to 
them are removed from further analysis, and only those 
transcripts that either have putative piRNAs with 
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Figure 5. Putative piRNAs in hydra. (A) Graph showing distribution of ping-pong-dependent and ping-pong-independent putative piRNAs that 
map to repeat elements, unique regions on hydra genome and to transcripts. Putative piRNAs mapping to transcriptome and transposable elements 
show a significant number of piRNAs with ping-pong signature. (B) Base preference for piRNAs that map to transposable elements. The left panel 
shows base preference of sense strand from piRNAs with ping-pong signature (top) which shows preference for 'A' at 10th place and base preference 
for 'LP at the first nucleotide at 5 7 -end in the corresponding antisense piRNAs (bottom). The right panel shows base preference for the piRNAs that 
map to transposable elements but do not have ping-pong signature. The top right panel shows sense strand and the bottom right panel shows 
anti-sense strand. Both strands show preference for 'LP at 5'-end, indicating processing by dicer. (C) Figure shows distribution of piRNAs mapping 
to unique regions on genome. The graph on top left shows base preference for these piRNAs, which show enrichment for 'LP at 5'-end. The panel on 
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Figure 6. Putative piRNAs mapping to transcriptome. (A) Size distribution of small RNAs mapping to transcriptome. The figure shows fraction of 
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bidirectional orientation or antisense orientation are taken 
into consideration. 

Interestingly, among the transcripts to which piRNAs 
map, we find several histone genes. piRNAs on histones 
are distributed all across the length of the transcript. We 
mostly observed sense strand piRNAs mapping to the 
histone transcripts compared with the antisense strand 
(Supplementary Figure S2). We also looked for base 
residue preference and found strong enrichment for U 
residue at 5'-end, suggesting they are AGO/PIWI 
cleavage products. 

Small RNAs in hydra regeneration 

To investigate whether small RNAs uncovered in the 
sequence analysis play a role in regeneration, we 
investigated the regulation of small RNA expression in 



hydra regenerates. Sequencing was performed for two dif- 
ferent sets of RNA isolated from regenerating heads at 
3 h, 1 day, 2 days and 3 days after mid-gastric bisection. 
miRNA reads obtained from each head regenerate were 
first normalized to per million reads, and the correlation 
coefficient between the duplicates was calculated. There 
was a significant (>0.95) correlation between duplicates 
(Supplementary Figure S3), hence further analysis was 
done by averaging the number of reads per million for 
each duplicate. miRNAs with >10 raw reads were con- 
sidered for further analysis for changes in expression 
levels. To investigate changes in the expression levels of 
miRNAs that were altered during regeneration, we 
calculated the fold difference in expression level of each 
miRNA, as compared to the control and identified 
miRNAs that showed at least two-fold difference in 
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expression. Thus, we identified 10 miRNAs that showed 
differences in expression levels at different time points of 
hydra head regeneration; 2 miRNAs were up-regulated 
and 8 were down-regulated during regeneration 
(Figure 7 A and Supplementary Table S4). We further 
validated the changes in expression for three miRNAs, 
hma-mir-newl3-3p, hma-miR-2022-3p and hma-miR- 
new40-3p, by quantitative PCR (Figure 7B) (35). 
hma-miR-2022-3p showed >50% reduction in expression 
at 3 days of regeneration during deep sequencing, as 
compared to the control. Consistent with these observa- 
tions, quantitative PCR showed a 60% reduction in the ex- 
pression level of hma-miR-2022-3p. Similarly, miRNA 
hma-mir-newl3-3p, which showed increased expression at 
3h by deep sequencing showed 80% up-regulation and 
hma-mir-new40-3p showed 4-fold up-regulation at 3h and 
2 days of regeneration by quantitative PCR. 

We performed similar analysis to investigate the changes 
in expression of putative piRNA clusters mapping to tran- 
scripts, genomic contigs and transposable elements. 
Although we did not find significant changes in expression 
of the putative piRNA cluster that map to transposable 
elements and unique regions on the contig, we identified 
165 new putative piRNA clusters that map to various tran- 
scripts and show at least two-fold variation in expression 
levels (Supplementary Table S5). For this analysis, piRNAs 
mapping to 4406 transcripts as mentioned earlier under 'pu- 
tative piRNAs in hydra' were considered. piRNA reads 
were normalized to per million of total reads mapping to 
transcriptome. Out of these filtered reads, reads showing at 
least two-fold difference in expression as compared to the 
control were selected as differentially expressed piRNAs. 
We identified 165 piRNA clusters that show either up-regu- 
lation or down-regulation during different stages of regen- 
eration. Most of the putative piRNA clusters mapping to 
transcripts show decreased expression during hydra head 
regeneration. Interestingly, some of these putative piRNA 
clusters map to the genes encoding histones suggesting 
piRNAs might play a role in up-regulating histone gene 
expression during regeneration (Figure 8A, Supplementary 
Table S5). We also identified 38 putative piRNA clusters 
that map to transcripts that are up-regulated during hydra 
head regeneration. We further investigated whether the 
piRNAs could regulate the transcripts they map to in 
hydra. We performed quantitative PCRs to analyse 
whether the transcripts to which piRNAs map are differen- 
tially expressed. Transcripts for Late Histone H2A2.2, 
Histone H4 replacement and a transmembrane protease 
showed a small, but consistent, up-regulation at the time 
point when the corresponding piRNAs were 
down-regulated (Figure 8B). This suggests that piRNAs 
could potentially regulate expression of hydra transcripts 
during regeneration. Taken together, these results indicate 
a potential role for miRNAs and piRNAs in regulating gene 
expression during hydra regeneration. 

DISCUSSION 

In this study, we report profiling of miRNAs and piRNAs 
from the cnidarian model system hydra by deep 



sequencing. Three new findings emerge from our 
analysis; first, we have identified 112 new miRNAs in 
hydra, adding substantially to the 17 miRNAs reported 
earlier (30). Second, hydra also expresses a number of 
endo-siRNAs which arise from perfect stem-loop struc- 
tures, with l-2nt overhangs at the 3'-end of the precursor. 
Unexpectedly, the most abundant small RNAs in hydra 
are piRNAs that map to both repeat sequences and 
unique genomic regions, including transcripts. Sequence 
analysis of putative piRNAs suggests that these could be 
processed by both ping-pong-dependent and ping-pong- 
independent mechanisms. Finally, we show that several 
miRNAs and piRNAs are regulated during regeneration, 
indicating a role for small RNAs in hydra regeneration. 

Our study has identified several novel miRNAs that 
were not previously identified (30), and this can be 
attributed to the improved depth of sequencing in this 
study. The most striking feature of hydra miRNAs 
identified here is their unique repertoire and diversity. 
Only three hydra miRNAs are conserved in another cni- 
darian, Nematostella, and none of the hydra miRNAs are 
conserved in any higher metazoan or in sponges. 
Surprisingly, hydra does not express two highly conserved 
miRNAs mir-100 and let-7. mir-100 is expressed in all 
bilaterians as well as Nematostella while let-7 is expressed 
in all bilateria (13). A search for sequences of let-7 or 
miR-100 yielded no genomic contigs that could potentially 
encode these miRNAs, suggesting that hydra genome does 
not encode these miRNAs. A similar observation has been 
made in the sponge Amphimedon, where none of the 
miRNAs identified were conserved in higher metazoans 
(10). It is interesting to note that miRNAs in bilateria 
are highly conserved, but those from diploblasts like 
Amphimedon and hydra appear to be unique to each 
organism. Conceivably, miRNA genes have independent 
evolutionary origins in the basal metazoa. Alternatively 
small RNAs in basal metazoa might be highly diverse 
owing to higher rate of evolution. Small RNA profiling 
of sponges and cnidaria like jellyfish, Hydractinia and 
Acropora can further explore this question. 

Within the genus hydra, each species analysed expresses 
a large number of unique miRNAs, with only 11.3% 
miRNAs conserved between all three species analysed 
here. In all, 12.6% of miRNAs present in 
H. magnipapillata were identified in H. vulgaris AEP and 
45.7% were identified in the H. vulgaris Ind-Pune. 
Whether this lack of detection is due to absence of expres- 
sion of miRNAs or due to absence of miRNAs from the 
genome cannot be explored at this stage, as the genomic 
sequences of these two species are not available. Hydra 
miRNAs are perhaps under evolutionary pressure that 
necessitated evolution at a higher rate than higher 
metazoa, and therefore show less conservation across 
species. For example, each species of hydra has specific 
strains of bacteria as symbionts, and this pattern of asso- 
ciation is maintained even after the species are cultured 
under laboratory conditions for several years (55). It is 
possible that the species-specific miRNAs evolved to 
regulate genes involved in response to such commensals, 
symbionts or parasites associated with each species. 
Alternatively, species-specific miRNAs could also target 
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Figure 7. Changes in miRNA expression during head regeneration. (A) Heat map showing regulation of miRNAs during hydra head regeneration. 
The number of reads for each miRNA for each regeneration time point was normalized to per million reads; duplicates for each time point were 
averaged and then expressed as fold change over control expression levels. Log values of fold change to the base 2 were used for generating heat 
map. Only the miRNAs showing correlation coefficient of >0.95 and with at least 10 raw reads were considered for the analysis. (B) Quantitative 
PCR for miRNA hma-miR-2022-3p, hma-mir-newl3-3p and hma-mir40-3p showing regulation of these miRNAs during head regeneration. 
hma-miR-2022-3p is down-regulated 3 days after mid-gastric cut, whereas hma-mir-newl3-3p is up-regulated at 3h after mid-gastric cut and 
hma-mir40-3p is up-regulated 3h and 2 days after mid-gastric cut. 



species-specific genes from these organisms. Several 
genome sequences analysed so far have reported around 
10-20% genes that have no significant similarity to se- 
quences in other organisms and these are termed as 'taxo- 
nomically restricted genes' (TRG) (56). It is possible that 
the species-specific miRNAs identified in this study 
regulate TRGs. Moreover, the hydra genome is itself 
highly dynamic, being rich in transposable elements, 
which contribute to 57% of the genome (30). The 
genome size within hydra species is also variable (57) 
and the genome organization is believed to have been 
influenced by bursts of transposable element expansion 
and horizontal gene transfer. These changes to genome 
organization and acquisition of genes through retrotran- 
sposition could have exerted evolutionary pressure leading 
to species-specific evolution of miRNAs to regulate these 
genes. Alternatively, it is also possible that evolution of 
miRNAs in turn could have led to speciation of hydra. 



miRNAs are often found organized in clusters on 
genome in organisms like Drosophila and vertebrates, 
while the earliest triploblast Planarian S. mediterranea 
has seven such clusters (18). Such clusters of miRNAs 
are believed to be important for coordinated regulation 
of gene expression, and the numbers of miRNA clusters 
in a genome appear to correlate with organismal complex- 
ity. We found one miRNA cluster with two miRNA loci in 
the hydra genome (Supplementary Figure SIB and C). No 
clusters have been reported in Nematostella or 
Amphimedon (10), suggesting that coordinated regulation 
of miRNAs through organized clusters may not be a 
common mechanism of miRNA regulation in diploblasts. 

We have also probed alterations in miRNA expression 
during hydra head regeneration. Ten miRNAs that are 
regulated at least by two-fold at various time points fol- 
lowing bisection were identified, suggesting a role for 
miRNAs in the regulation of gene expression during 
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Figure 8. Changes in piRNA expression during head regeneration. (A) Heat map showing piRNAs that map to transcriptome and are regulated 
during hydra head regeneration. piRNA expression was first normalized to expression per million of total number of 25-30 nt reads that mapped to 
the transcriptome; duplicates for each time point of regeneration were averaged and normalized reads were expressed as fold change over control. 
Further only the piRNAs with >100 reads per kilobase of transcript were considered to check for changes in expression levels. (B) Quantitative PCR 
for transcripts to which piRNAs map and are differentially regulated during regeneration. Down-regulation in piRNA levels during regeneration is 
expected to lead to increased transcript levels. The figure shows increase in transcript levels of Late Histone H2A2.2, Histone H4 replacement and a 
transmembrane protease at time points when corresponding piRNAs are down-regulated as shown in Supplementary Table S5. Each qPCR was done 
with two independent biological replicates, with each PCR done in triplicate for each time point. 
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regeneration. Most of these miRNAs (8 of 10) were 
down-regulated during head regeneration, indicating a po- 
tential role in suppression of genes that play a role in head 
formation and/or tentacle development. Interestingly, we 
observe that although at 3 days the regenerating hydra 
morphologically resembles a complete organism, none of 
the miRNAs that are regulated return to expression levels 
seen in unperturbed control organisms, suggesting that the 
process of regeneration is not complete at this time. Eight 
of these 10 miRNAs that were regulated during head 
regeneration are also conserved in H. vulgaris Ind-Pune. 
Interestingly we found only 2 out of 10 regulated miRNAs 
conserved in H. vulgaris AEP strain (Supplementary Table 
S4). Conceivably, the AEP strain, which is more closely 
related to H. carnea than other vulgaris species, might 
have evolved a set of novel miRNAs for regulation of 
regeneration. 

As in Amphimedon and Nematostella (10), we find that 
piRNA-like small RNAs represent the most abundant 
class of small RNAs in hydra. These putative piRNA se- 
quences were derived from the asexual form of hydra, 
suggesting they are not expressed in germ cells as in 
mammals, but as in Planarian S. mediterranean may be 
expressed in stem cells (7). A transcriptome sequencing 
of different cell lineages of hydra shows that all three 
cell lineages in hydra, ectodermal cells, endodermal cells 
and interstitial cells, express piwi transcripts (58), which 
indicates that putative piRNAs could be present in the 
stem cells of all three cell lineages. In hydra, putative 
piRNAs mapped to both repeat elements as well as 
unique loci in the genome. A total of 57.2% putative 
piRNAs that map to repeat elements showed a signature 
of ping-pong-dependent biogenesis, confirming the earlier 
observations that this pathway of piRNA biogenesis 
evolved early (10). We also identified a large number of 
putative piRNAs without a ping-pong signature. 
At present, we cannot rule out the possibility that these 
are direct cleavage products of Argonaute proteins rather 
than PIWI proteins, which can only be resolved by 
knocking down the argonaute and piwi genes. We specu- 
late that the putative piRNAs mapping to transposable 
elements would be involved in transposon silencing 
either via transposon degradation or heterochromatin for- 
mation, thus maintaining genome integrity. However, 
only 10% of putative piRNAs map to repeat elements 
and a larger number maps to unique regions on the 
genome, suggesting piRNAs in hydra may play a wider 
role than regulation of transposable elements. These 
putative piRNAs resemble pachytene piRNAs of 
mammals in terms of their genome organization. We 
found at least 12 clusters of piRNAs with > 100 000 
reads that map to different contigs. In the absence of 
genome assembly, it is difficult to predict the exact 
number of piRNA clusters and their location. In 
mammals, pachytene piRNA clusters are important for 
maintenance of germ line integrity (6) and the presence 
of similar piRNA clusters in an asexual strain of hydra 
makes it interesting to study their role in hydra biolog\y. 

We identified several putative piRNA clusters that map 
to transcripts, suggesting that piRNAs may be important 
for gene regulation in hydra. A total of 27% of these 



piRNAs have ping-pong signature. The remaining 73% 
could either be associated with the ping-pong-independ- 
ent pathway or could be Argonaute-processed small 
RNAs. These putative piRNA clusters could potentially 
regulate gene expression. Indeed, we find that a small, but 
significant, number of piRNAs that map to 165 transcripts 
are either up-regulated or down-regulated during head 
regeneration. Interestingly, some of these transcripts 
where piRNA clusters were down-regulated include 
various histone genes such as H2B, HI and late histones 
H2A. Some of these genes such as Histone H2A and 
Histone H4 replacement show increase in expression 
levels at the time point where their corresponding 
piRNAs are down-regulated, indicating that these 
putative piRNAs could indeed regulate transcript levels. 
Recently, it has been shown that specific variants of 
histone HI are expressed in human embryonic cells and 
induced pluripotent cells but not in differentiated cells 
(59). It is possible that a similar regulation of histone 
genes could be important for hydra regeneration. 
In Drosophila, piRNAs map to the 3 / -UTR region of 
maf transcripts raising the possibility of regulation of 
expression of maf loci by piRNAs (8,9). piRNAs derived 
from the intronic region of the human melatonin receptor 
1A (mtnrla) also negatively regulate mtnrla gene expres- 
sion by binding to the genomic region (60). These isolated 
studies show that in addition to transposon regulation, 
piRNAs are likely to play alternate roles in regulating 
gene expression. It is interesting to note that putative 
piRNAs in hydra appear to regulate diverse genomic 
elements and that their role is not limited to regulation 
of gametogenesis as in higher metazoa. Similar to piRNAs 
in S. mediterranean piRNAs in hydra could be important 
for adult stem cell function. It is possible that piRNAs 
that arose early in evolution played a wider role and 
their role eventually became more specialized for regula- 
tion of transposable elements in germ cells in higher 
organisms. 

In summary, our study has identified a large number of 
novel miRNAs and piRNA-like small RNAs, some of 
which are regulated during head regeneration in 
H. magnipapillata. These findings serve as the platform 
from which to address more challenging task of defining 
the targets of miRNAs, expression of piRNAs in different 
cell types, biogenesis of piRNA-like small RNAs and 
functional roles played by these small RNAs in the 
unique biology of hydra. 
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